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The factorial validity of the Maslacn Burnout 
Inventory (MBI) was studied for 2,931 Canadian teachers (48.2% males 
and 51.8% females) as a single professional group and for subsamples 
of this group (1,159 elementary school teachers, 388 intermediate 
school teachers, and 1,384 secondary school teachers). Study 
participants were full-time teachers from two large metropolitan 
areas in central Canada. Tho subjects represented about 42% of a 
sample originally sent the study questionnaire. The best-fitting 
factorial niodel was cross-^validated for samples made by splitting the 
original group. Exploratory factor analysis and confirmatory factor 
analysis (CFA) were used to validate the hypothesized three-factor 
structure. CFA was subsequently used to: (1) identify aberrant scale 
items; (2) determine, via double cross-validation procedures, the 
model having the greatest predictive validity; and (3) test for the 
invariance of measurement and structural relations across calibration 
and validation samples. Although findings strongly support a 
three-factor structure, the deletion of four ittms (numbers 2, 11, 
12, and 16) yielded a remarkably improved and psychologically sound 
instrument. Compared with data for the elementary school and 
secondary school teachers, model fit for intermediate school teachers 
was more problematic, although consistent with past research. A 
57-jtem list of references is included, and an appendix gives a 
breakdown of MBI items on factors of burnout (emotional exhauiition, 
depersonalization, and personal accomplishment). Four tables present 
study data. (SLD) 
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Abs t rac t 

Purposes of the study were: (a) to test for the factorial 
validity of the Maslach Burnout Inventory (MBI), for teachers as 
a single professional group (£=2,931), and for subsamples of 
elementary (ji=l,159). Intermediate (ji«388), and secondary 
(ji«l,384) teachers, and (b) to cross-validate the best-fitting 
factorial model for a second independent sample. Exploratory, as 
well as confirmatory factor analyses (CPAs) were used to validate 
the hypothesized 3-factor structure; CFA was subsequently used to 
(a) identify aberrant scale items, (b) determine, via double 
cross-validation procedurer the model having the greatest 
predictive validity, and (c) test for the invariance of 
measurement and structural relations across calibration and 
validat io n sam])les<, Although findings strongly supported a 
3-factor structure, the deletion of four items (#2, #11, #12, 
#16) yielded a remarkably improved and psy chomet r i c a 1 1 y sound 
instrument. Compared with the elementary and secondary levels, 
model fit for intermediate teachers was more problematic, albeit 
consistent with past research- This study is the most rigorous to 
date in testing the factorial validity of the MBI; findings are 
expected to contribute importantly to fut^^re honing of the 
instrument for use with educators. 
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The Maslach Burnout Inventory: A Cross--val idated Reassessment 
of Factorial Validity for Elementary, Intermediate, and 

Secondary Teachers 

The Maslach Burnout Inventory (MBI; Maslach & Jackson, 
1981a, 1986) is undoubtedly the most widely used measure of 
occupational burnout, the inability to function effectively in 
one's job as a consequence of prolonged and extensive 
job-related stress* To date, however, assessment of the MBI^s 
factorial validity has been limited to exploratory techniques 
that have included both factor and principal components 
analyses; these procedures, however, lack the statistical rigor 
to directly te hypotheses related to factorial structure. 
Moreover, validity studies of the MBI, as it bears on the 
teaching profession, have most often been conducted on samples 
that combined teachers representing different levels of the 
educational system (e.g., Gold, 1984; Iwanlcki & Schwab, 1981). 
Finally, cross-validation of the MBI has not been tested 
statistically in a rigorous simultaneous analysis of data. The 
primary purposes of the present study, then, were twofold; (a) 
to test the factorial validity of the MBI both for teachers as 
a single professional group, and for elementary, intermediate, 
and secondary teachers as separate professional groups, and (b) 
to cross-validate the factorial structure of the MBI across a 
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second independent sample for each of these groups; both sets 
of analyses used a confirmatory factor analytic approach to the 
data based on the analysis of covariance structures. 

The seminal research of Maslach and colleagues was the 
first of an empirical nature to investigate the phenomenon of 
burnout (for an historical summary, see Maslach, 1981a; Maslach 
& Jackson, 1984). Their findings were consistent in supporting 
a multidimensional construct comprising ^hree related, yet 
independent components: (a) emotional exhaustion feelings of 
fatigue that develop as one's emotional energies become 
drained, (b) depersonalization the development of negative 
and uncaring attitudes toward others, and (c) reduced personal 
accomplishment a deterioration of sel f -competence , and 
dissatisfaction with one's achievements. 

These three elements of burnout have been empirically 
validated for elementary, intermediate, and secondary school 
teachers (Beck & Gargiulo, 1983; Belcastro, Gold, & Hays, 1983; 
Byrne, 1990a; Fimian & Blanton, 1987; Friesen & Sarros, 1989; 
Gold, 1984; Green & Walkey, 1988; Iwanicki & Schwab, 1981; 
Jackson, Schwab, & Schuler, 1986; Schwab ^ Iwanicki, 1982a, 
1982b), as well as for administrative educators (Friesen & 
Sarros, 1989) and teacher trainees (Fimian & Blanton, 1987). 
Teachers exhibit signs of emotir jal exhaustion when they 
perceive themselves as unable to give of themselves to 
students, as they did earlier in their careers. 
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Depersonalization is evidenced when teachers develop negative, 
cynical and sometimes callous attitudes towards students, 
parents and colleagues. Finally, teachers reflect feelings of 
reduced personal accomplishment when they perceive themselves 
as ineffective in helping students to learn, and in fulfilling 
other school responsibili t ies • Overall, teachers who fall 
victim to burnout are likely to be less sympathetic toward 
students, have a lower tolerance for classroom disruption, be 
less apt to prepare adequately for class, and feel less 
committed and dedicated to their work (Farber & Miller, 1981). 

The development of the MBI was based on samples of workers 
from a wide range of human service organizations, including 
nurses, teachers, police officers, physicians, social workers, 
psychologists, psychiatrists, and lawyers (Maslach & Jackson, 
1981a, 1986). The instrument comprises 22 items designed to 
measure both the frequency and intensity of burnout as 
represented by three components emotional exhaustion (EE), 
depersonalization (DP), and personal accomplishment (PA), 
Findings of high correlations between the frequency and 
intensity dimensions of burnout for teachers have led to 
recommendations that only the former be used with this 
professional group (Gold, 1984; Iwanicki & Schwab, 1981). (The 
suggestion, however, has been followed for other occupations as 
well; see e.g.. Firth, Mclntee, McKeown, & Britton, 1985; 
Golembiewski , Munzenrider, & Carter, 1983; Green & Walkey, 
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1988. ) 

As a consequence, In large part, ot Schwab and Iwanicki's 
(1982a, 1982b; Iwanicki & Schwab, 1981) work related to burnout 
among teachers, Maslach and Jackson (1986), in collaboration 
with Schwab, developed the Educators' Survey (MBI Form Ed), a 
version of the MBI specifically designed for use vit.h teachers. 
The MBI Form Ed measures the same three factors of burnout as 
the original version of the MBI; the only difference between 
the two versions lies in the modified wording of certain items 
to make them more appropriate to a teacher's work environment. 
Specifically, the generic terra "recipient", used in the MBI to 
refer to clients, has been replaced by the term "students". 

Most EFAs of the MBI have yielded three moderately to 
well-defined burnout factors representing EE, DP, and PA for 
various human service professionals in general (see Green & 
Walkey, 1988; Maslach & Jackson, 1981b), and for teachers in 
particular (Beck & Gargiulo, 1983; Belcastro et al . , 1983; 
Byrne, 1990; Gold, 1984). Some researchers, however, have 
concluded a 2-factor (Brookings, Bolton, Brown, & McEvoy, 
1985), or a 4-factor (Firth et al . , 1985; Powers & Gose, 1986) 
structure for human service workers, and a 4-factor structure 
for educators (Iwanicki & Schwab, 1981). In an EFA study of 
intermediate, secondary, and post secondary educators, Byrne 
(1990a) critically examined data within the framework of 2-, 
3-, and 4-factor solutions. Although results partially 
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replicated those supporting alternate factorial structu.es, 
they were inconsistent across the three groups, and yielded 2- 
and 4-factor solutions that were exemplary of under- and 
overf actoring , respectively (see Gorsuch, 1983: Walkey, 1983, 
1985). Byrne therefore concluded a 3-f actor solution to be 
optimal both statistically and substantively. 

Based on a substantial amount of evidence, it now seems 
abundantly clear that the MBI is most adequately defined by a 
3-factor solution. A review of construct validity research, 
however, suggests the need for possible improvement to item 
content, at least p.s it relates to teachers. In this regard, a 
number of researchers have noted problematic loading patterns 
for five particular items (#6, #11, #12, #16, #20). Items 6, 
16, and 20, designed to measure EE, have been found either to 
load incorrectly or to cross-load onto the DP factor (Belcastro 
et al., 1983; Byrne, 1990a; Fimian & Blanton, 1987; 
Golembiewski et al., 1983; Green & Walkey, 1988), and Items 11 
and 12, measuring DP and PA, respectively, to cross-load onto 
the EE factor (Byrne, 1990a; Golembiewski et al., 1983; Green & 
Walkey, 1988; Powers & Gose, 1986). Interestingly, these same 
factor loading patterns are consistent with those tabled by 
Maslach and Jackson (1986) and Gold (198A), albeit the Issue of 
cross-loadings was not addressed. In a further confirmatory 
factor analytic (CFA) study of her data, Byrne (1990a) 
substantiated the problematic fit of these five items for 
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intermediate, secondary, and university educators; a more 
adequate 3-factor solution was attained by deleting Items 12, 
16 and 20, and by permitting Items 6 and 11 to cross-load for 
the intermediate and university groups, respectively. 

Additional psychometric analyses, have shown the EE, DP, 
and PA subscales of the MBI to demonstrate strong evidence of 
(a) internal consistency reliability, with alpha coefficients 
ranging from .52 to .91 (mean a » .77) (Beck & Gargiulo, 1981; 
Belcastro et al., 1983; Flralan & Blanton, 1987; Golembiewski et 
al., 1983; Iwanicki & Schwab, 1981; Leiter & Maslach, 1988; 
Maslach & Jackson, 1981b), (b) test-retest reliability, with 
coefficients based on a 2 to 4-week interval ranging from .60 
to .82 (mean r - .74) (Maslach & Jackson, 1981b), (c) 
convergent validity with external criteria including personal 
experience (observations), dimensions of job experience, and 
personal outcomes (Maslach & Jackson, 1981b, 1986), and (d) 
discriminant validity as evidenced by low and nonsignificant 
correlations between MBI scores, and job satisfaction and 
social desirability (Jackson et al., 1986; Maslach & Jackson, 
1981b, 1986). 

Taken together, these findings provide good support for the 
MBI as a potentially reliable and valid measure of teacher 
burnout. However, given numerous replicated findings related to 
five possibly aberrant items (#6, #11, #12, #16, #20), it seems 
apparent that still further construct validity research is 
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needed in order to more fully establish the psychometric 
soundness of the instrument. To this end, it is important to 
note several limitations in previous validity work bearing on 
the MBI. First, except for one study (Byrne, 1990a), the MBI 
has not been validated separately for elementary, intermediate, 
or secondary teachers; studies, to date, have sampled teachers 
from different teaching levels and assumed representation from 
a single population* However, given that the work environments 
of elementary, middle (i.e., junior high), and secondary 
schools can differ substantially in terms of both academic and 
nonacademic responsibilities imposed, the possibility of 
differential perceptions of item content and/or relations among 
burnout facets seems worthy of investigation. Second, despite 
known correlations among the three dimensions of burnout, all 
but two factor analytic studies (Brookings et al-, 1985; Byrne, 
1990a) have used oblique rotational procedures in the search 
for simple structure. Third, except for one study (Byrne, 
1990a), factorial validity of the MBI has been examined using 
only an exploratory approach, based for the most part, on 
principle components analyses. Now well documented, however, 
are several deficiencies associated with such exploratory 
procedures in general (see e.g., Bollen, 1989; Fornell, 1983; 
Long, 1983; Marsh & Hocevar, 1985), and components analysis in 
particular (see e.g., Borgatta, Kercher, & Stull, 1986j 
Gorsuch, 1990; Hubbard & Allen, 1987; Snook & Gorsuch, 1989); 
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the latter having been shown to yield highly inflated factor 
loadings and, thus, mislead ingly clear factor structures. 
Indeed, CFA is now widely accepted as the more powerful test of 
factorial validity. Finally, the factorial structure of the MBI 
has not been cross- val idated with an independent sample using 
simultaneous data analyses. 

The present study addressed these concerns by (a) testing 
the factorial validity of the MBI for teachers, both as a 
single professional group, and as separate elementary, 
intermediate, and secondary teacher groups, (b) using a CFA 
approach to the validation inquiry based on the analysis of 
covariance structures, (c) conducting double cross-validation 
procedures to determine the factorial model having the greatest 
predictive power for both the single and separate groups of 
teachers, and (d) based on findings determined in (c), testing 
for the invariance of factorial measurement and structure 
across validation samples in a simultaneous analysis of the 
data • 

Method 

Sample and Procedure 

Participants in the study were full-time elementary, 
intermediate, and secondary school teachers from two large 
metropolitan areas in Central Canada. Using stratified 
proportional sampling procedures, a total of 7,000 elementary/ 
intermediate (n=360O) and secondary (n«3A00) teachers were 
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randomly selected from the membership roster of the Ontario 
Teachers' Federation; this represented approximately 30% of the 
teacher population across the two urban centers. A 46% response 
rate resulted in questionnaires being received from 3188 
teachers. Listwise deletion of missing data ultimately yielded 
a final sample of 2,931 teachers (elementary n^ll59; 
intermediate na388; secondary n«=1384)- 

The sample in its entirety comprised a total of 48.2% males 
and 51.8% females, most of whom (50.9%) were 40-49 years of 
age, married with children (59.3%), and had been teaching, or 
more than 13 years (77,8%); of the latter, 44.4% had over 21 
years experience. The majority of respondents (70.0%) were 
fulltime classroom teachers, 13.0% had combined classroom and 
administrative duties, 8,5% were fulltime administrators 
(principals, vice principals), the remainder were guidance 
counselors. Finally, while most of these educators (70.1%) 
taught students registered in the regular academic track, 17.2% 
taught those in the low academic track; thi remainder taught 
either special education (10.7%), or gifted (2.6%) students. 

Sample composition by teaching level deviated only with 
respect to gender, for the Intermediate level. Elementary 
teachers were composed of 42% males and 58% females; most 
(51.3%) were in the 40-49 age range, were married with children 
(57.7%), had been teaching for more than 13 years (79.2%), were 
fulltime classroom teachers (75.8%), and taught students in the 
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regular acaderaic stream (77.3%). Intermediate teachers 
comprised 67.6% males and 32.4% females. Again^ most (55%) were 
between the ages of 40 and 49, were married with children 
(65.4%), had been teaching longer than 13 years (79.1%), were 
fulltirae classroom teachers (75.6%), and taught regular 
academic students (74.2%). Finally, of the secondary school 
teachers, 47.4% were male and 52.6% were female; 49*3% were 
40-49 years of age, most were married with children (58. 4%), 
had been teaching more than 13 years (76.1%), were fulltime 
classroom teachers (64.2%), and taught students in the regular 
academic stream (63.^%). The high school sample differed 
slightly from the number of teachers whose workload included 
both classroom and administrative duties (17.6%), and who 
taught low-track students (29.3%). 

The present study was based on data drawn from a larger 
study of teacher burnout (Byrne, 1990b) and, thus, the MBI was 
only one of a battery of ten measuring instruments mailed to 
randomly selected teachers; additional materials included a 
demographic response sheet, and explanations regarding che 
general purpose of the study, procedures for completing the 
questionnaire, and guaranteed anonymity and confidentiality. 
Followup reminder letters were sent to all teachers 
approximately four weeks following the initial mailing. 

Given mean univariate skewness and kurtosis values within 
the approximate range from -1.0 to +1.0 (see Muthen & Kaplan, 



- ' " ■ " 

* t 

MBI Cross-validatio 

13 

1985), the data were considered to approximate a normal 
distribution. Skewness ranged from -2,015 to 2.036 (M«-.037) 
for the total sample, from -2.228 to 2,722 (M«,075) for 
elementary teachers, from --2. 084 to 1.699 (Ma-.123) for 
intermediate teacher- and from -1.982 to 1.760 (M«.008) for 
secondary teachers. Kurtosis ranged from -1.069 to 5.056 
(M=.998) for the total sample, from -1.146 to 7.928 (M=1.560) 
for elementary teachers, from -1.C68 to 5.119 (M=».809) for 
intermediate teachers, and from -1.067 to 5.107 (M=.850) for 
secondary teachers. 

Instrumentation 

The MBI (Form Ed; Maslach, Jackson, & Schwab, 1986) was 
used to measure burnout for teachers at the elementary, 
intermediate, and secondary levels. The 22-item instrument is 
structured on a 7-point fully anchored scale ranging from 0 
"feeling has never been experienced", to 6 "feeling is 
experienced daily". The EE, DP, and PA subscales comprise nine, 
five, and eight items, respectively; all items are listed in 
the Appendix. 

Analysis of the Data 

For purposes of cross-validation, the total sample as well 
as subsamples representing elementary, intermediate, and 
secondary teachers were randomly split into two (see Bollen, 
1989; Cudeck & Browne, 1983); for each of these four groups. 
Sample A comprised the calibration sample, and Sample B the 
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validat'on sample. Data were subsequently analyzed in five 
stages. First, for each calibration sample, CFA procedures 
using the LISREL VI computer program (Joreskog & Sorbora, 1985) 
were conducted to test the hypoth 'ized 3-factor structure 
underlying the MBI. Second, given findings of less than 
adequate CFA model fit, analyses proceeded in an exploratory 
mode using both EFA and post hoc CFA procedures in order to 
identify parameters contributing to model misfit. Third, 
alternative models were subsequently specified and tested for 
their goodness-of -f i t to a 3-factor structure* Fourth, 
best-fitting models were tested across calibration and 
validation samples using double cross-validation procedures to 
determine the one yielding the highest degree of predictive 
validity (see Cudeck & Browne, 1983; Bagozzi & Yl, 1988). 
Finally, the MBI model yielding the highest predictive validity 
was further cross--val idated under a set of three increasingly 
restrictive conditions; MBI equivalence across calibration and 
validation samples was tested in a cumulative manner with 
respect to (a) number of factors, (b) item measurements, and 
(c) underlying theoretical structure. 

Although, historically, the chi square ( ) likelihood 
ratio has been used to judge the extent to which a proposed 
covariance structure model fits the observed data, the 
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sensitivity of the X statistic to sample size is now well 
known; several alternative nons tat ist ical (i.e., practical) 
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indices of fit have subsequently been proposed (for a recent 
review, see Marsh, Balls, & McDonald, 1988). As recommended by 
a number of researchers (e.g., Bagozzi & Yi, 1988; Fornell, 
1983; Joreskog, 1982; Long, 1983; Marsh & Hocevar, 1985), 
assessment of fit in the present study was based on multiple 
criteria that reflected statistical, theoretical, and oractical 
considerations. Furthermore, parsimony, as well as degree of 
fit were taken into account (see Bentler & Mooijaart, 1989; 
Mulaik, James, Alstine, Benn?tt, Lind, ^ Stilwell, 1989). As 
such, evaluation of model fit was based on (a) the likelihood 
ratio, (b) the Bentler (1990) revised normed comparative fit 
index (CFI), (c) the parsimonious normed comparative fit index 
(PCFI; Mulaik et al., 1989), (d) the adjusted goodness-of - f i t 
index (AGFI), T-values (parameter estimates relative to their 
standard errors), and modification indices (Mis), all provided 
by the LISRF>L program, and (e) the substantive meani ngf ulness 
of the model (see MacCallum, 1986; Suyapa, Silvia, & MacCallum, 
1988; Walkey, 1983). 

The CFI represents a modified version of the Bentler and 
Bonett (1980) normed fit index that has been corrected for 
sample size dependency (Bentler, 1990); it is included here 
because of widespread use of the former index. Plausible models 
yield CFI values >.90, indicating a reasonably adequate 
approximation to the observed data. The PCFI is based on the 
CFI; the originally proposed index (Mulaik et al., 1989) has 

ERIC ' 



MBI Cross-validation 

16 



been adapted here for use with the CFI* Because this index 
takes into account goodness-of i t , as well as the parsimony of 
the model, its value is often much lower than what is generally 
perceived as "acceptable" on the basis of norraed ind?ces of 
fit. In fact, Mulaik and associates (1989) note that 
goodness-of-f it indices in the *90's accompanied by 
parsimonious fit indices in the . 50 * s are not unexpected. For 
purposes of the present study, however, PCFI values >.80 were 
considered indicative of an adequately specified model (see 
e.g., Mulaik et al . , 1989) . 

Results and Discussion 
Confirmatory Factor Analyses 

The CFA model in the present study hypothesized a priori 
that: (a) responses to the MBI could be explained by three 
factors, (b) each item would have a non-zero loading on the 
burnout factor it was designed to measure, and zero loadings on 
all other factors, (c) the three factors would be correlated 
and, (d) the uniqueness terms for the item variables would be 
uncorrelated . (The term "uniqueness" is used here in the factor 
analytic sense to mean a composite of specific and random 
measurement error which, in cross-sectional studies, cannot be 
separated; for an extended discussion, see Gerbing & Anderson, 
1984). In testing for the factorial validity of the MBI, all 
analyses were based on covariance matrices and conducted 
separately for calibration and validation samples representing 
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teachers as a whole, and elementaryt intermediate, and 
secondary teachers as separate subsaraples* 

Somewhat surprisingly, given the present sample sizes, fit 
of the hypothesized 3-factor model (Model 1) for both teachers 
as a single group, and for elementary, intermediate, and 
secondary teachers as separate groups, was poor from both a 
statistical ( values) and a practical (AGFI, CFI, PCFI 
values) perspective; this model was therefore rejected. These 
results are summarized in Table 1. (Model 0 argues that each 
item represents a factor and therefore represents the 22-factor 
null model that is needed to compute the CFI), 



Insert Table 1 about here 



Given rejection of the initially postulated model, the next 
logical step was to take an exploratory approach to the 
analyses in order to determine reasons for the ill-fitting 
model. Of particular import was the question of (a) whether for 
each teacher group, the data were best described by a 3-factor 
structure, (b) whether across teacher groups, MBI items 
exhibited the same pattern of loadings on all factors, and (c) 
whether the deletion of aberrant items would lead to a more 
adequately specified model. To answer these questions, the data 
were reanalyzed using EFA and post hoc (i.e., respecified) CFA 
procedures . 

ERIC T S 



MBI Cross-validation 

18 

Exploratory Factor Analyses 

EFAs using raaximuni likelihood estimation were conducted for 
the entire sample of teachers, and for elementary, 
intermediate, and secondary teachers separately. Based on 
reported low to moderate correlations among the burnout 
dimensions (see Maslach & Jackson 1986), and on preliminary 
data analyses of the present data, direct oblimln oblique 
rotation was used to achieve simple structure. Taking into 
account sample size, and the case/ variable ratio, 3 cutpoint of 
.35 was used as the criterion for judging the saliency of 
factor loadings (see Gorsuch, 1983)* 

As shown in Table 2, the data, for the most part, were well 
described by a 3-factor solution both for teachers as a single 
group, and for elementary and secondary teachers considered 
separately; results for intermediate teachers were clearly less 
than optimal and are discussed in more detail later. Adequacy 
of model fit was based on statistical, as well as substantive 
considerations. Statistically, several criteria supported 
argument for a 3-factor solution. First, all but one target 
factor loading was less than .35. Second, only two non-target 
loadings were >.20, and most (total=75%; elementary«70%; 
secondary«70%) were <.10- Third, the scree plot of eigenvalues 
(Cattell, 1966) clearly delineated a 3-factor solution. 
Finally, the three factors of IE, DP, and PA accounted for 
44.5%, 44.6%, and 44.5% of the variance for teachers as a 
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whole, and for elementary, and secondary teachers, 
respectively . 



Insert Table 2 about here 



EFA results for Intermediate teachers leave this researcher 
both frustrated and puzzled. Frustration derives from the fact 
that followup factor analyses were clearly indicative of a 
3-factor solution, although the pattern of salient loadings on 
two of the three factors (EE, DP) did not represent good simple 
structure; puzzlement arises from the similar pattern (albeit 
much more pronounced) of incorrectly and cross^loaded items as 
those reported by Byrne (1990a) for a smaller sample of the 
same teacher population. Specifically, there appears to be less 
differentiation between the EE and DP factors for intermediate 
teachers; consistent across both studies were the incorrect 
loading of Item 6, and the cross-loadings of Items 8, 13, 16, 
and 20 on the DP factor. Psychomet r ically , it appears that 
these items are more ambiguous for intermediate teachers and, 
thus, are not tapping perceptions of emotional exhaustion and 
depersonalization as they were intended to do. Nonetheless, 
results for 2- and 4-factor solutions provided ample evidence 
of under- and over factor ing , respectively (see Gorsuch, 1983; 
Walkey , 1983) . 
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An important finding reported in Table 2 was the consistent 
problematic loading of Items 12 and 16 across the three groups 
of teachers. In all cases, Item 12, designed to measure PA, and 
Item 16, designed to measure EE, cross-loaded substantially 
(>.20) on the EE and DP factors, respectively. These findings 
were investigated further using the more stringent CFA 
procedures. We turn now co the results of these analyses. 
Post Hoc Confirmatory Factor Analyses 

Although the EFA findings suggest that a 3-d imensional 
burnout construct underlies the MBI, and as such, that two 
Items (#12, #16) are possibly problematic, when used with 
elementary, intermediate, and secondary school teachers, the 
true test must come from a CFA approach to the analyses. 
Indeed, it is now widely accepted that EFA is limited in its 
ability to: (a) yield unique factorial solutions, (b) define a 
testable model, (c) assess the extent to which an hypothesized 
model fits the data, and suggest alternative parameterization 
for model improvement and, (d) adequately test factorial 
invariance across groups (Bollen, 1989; Fornell, 1983; Long, 
1983; Marsh & Hocevar, 1985). In contrast, CFA can yield this 
information and is therefore a more powerful test of factorial 
validity . 

Maintaining an exploratory mode of inquiry (see Anderson & 
Gerbing, 1988), a CFA model was specified for teachers as a 
whole, and for elementary, intermediate, and secondary teachers 
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separately, that reflected EFA results in common across these 
groupings. As such, a 3-factor model was formulated that 
allowed for the cross-loadings of Items 12 and 16 on the EE and 
DP factors, respectively. For goodness-of -f i t statistics 
related to this model, we turn back to Table 1. 

To compare the difference between the EFA-speci f ied model 
(Model 2) and the initially hypothesized model (Model 1), we 
evaluate the difference in ^^X^^ between the two models. 
Since this differential is itself x^'-^^^^^i^"^^^ • with degrees 
of freedom equal to the difference in degrees of freedom (^df), 
it can be tested statistically; a significant indicates a 

substantial improvement in model fit. Reviewing the results in 
Table 1, we see evidence of a highly significant improvement in 
fit between Models 1 and 2 for the total sample, and for each 
of the teacher subsamples. Although the fit of this model 
remains somewhat less than adequate as indicated by the AGFI , 
CFI, and PCFI , it nonetheless shows much improvement over the 
initially hypothesized model. These findings fortify the 
earlier EFA findings and suggest that Items 12 and 16 may be 
inappropriate for use with educators. Having identified two 
items that were consistently aberrant across teacher groups, 
the task now was to work towards improving the MBI Educators 
Survey by delineating a core set of items that could most 
validly measure the three facets of burnout for teachers. Thus, 
a third CFA model (Model 3) was specified in which Items 12 and 
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16 were deleted (see Anderson & Gerbing). (The deletion of 
items, of course, meant respecifylng the correct null model for 
calculation of the CFI and PCFI). 

Although results reported in Table 1 demonstrated further 
significant improvement in fit, AGFI and CFI values in the 
•80*s indicated that, for each teacher group, a certain degree 
of model misfit still remained* Indeed, a review of the Mis 
revealed two abnormally large values representing error 
covariances between Items 1 and 2 (Total » 331,227; Elementary 
a 89-55; Intermediate = 68.16; Secondary = 156. 38), and 
between Items 10 and 11 (Total = 244.454; Eleinentary = 68.46; 
Intermediate » 19.84; Secondary « 139.01). Such parameters most 
often represent nonrandom measurement error due to method 
effects associated with the response format of measuring 
instruments and are therefore not unexpected in the CFA of a 
single measuring instrument. Indeed, previous research with 
psychological constructs in general (e.g., Byrne, Shavelson, & 
Muthen, 1989; Joreskog, 1982; Newcomb & Bentler, 1986; Tanaka & 
Huba, 1984), and with measuring instruments in particular 
(Byrne, 1988a, 1988b; Byrne & Schneider, 1988) has demonstrated 
the need to allow for correlated errors in order to attain a 
well-fitting model. Thus, two additional models (Models 4, 5) 
were specified in which these error terras were free to covary, 
rather than contrained equal to zero. As shown in Table 1, 
these estimated models resulted in a further substantial 
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improvement in fit for all groups. 

Although Rentier and Chou (1987) have argued that the 
specification of a model that forces such error terras to be 
uncorrelated is rarely appropriate with real data, the 
estimation of such parameters does nothing to improve the 
validity of a measuring instrument. From a psychometric 
perspective it would seem more appropriate to consider the 
deletion of one item from each of the highly correlated pairs. 
Thus, in an effort to improve the MBI for use with teachers, 
four additional models were tested in which one item for each 
highly correlated error pair was deleted. This decision was 
guided by two important considerations. First, because the Mis 
for these parameters were abnormally high compared with other 
Mis in the same n.atrix, it seemed evident that responses to 
these particular items were influenced by some overlap in item 
content. Indeed, a review of the zero-order correlations 
between these item-pairs revealed fairly strong evidence of 
this for teachers in general (Items 1 & 2 r=.75; Items 10 & 11 
r«*69), and for elementary (Items 1 & 2 r».74; Items 10 & 11 
r».69), intermediate (Items 1 & 2 r«.79; Items 10 & 11 r^.60), 
and secondary (Items 1 & 2 ra.75; Items 10 & 11 r«.69) teachers 
considered as separate groups. This information, coupled with 
evidence of exceptionally similar item content (see Appendix), 
seemed justification for the elimination of one item from each 
pair. Second, the presence of extreme Mis associated with these 
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item pairs was not only consistent across groups in the present 
study, but was also consistent with findings by Byrne (1990a) 
with respect to the similar but smaller sample of teachers 
noted earlier* Given this convergence of evidence, it seemed 
apparent that the use of these items with teachers, elicited 
responses that were masked by other irrelevant factors. 

Thus, with a view to improving the psychometric soundness 
of the Educators Survey, it now became important to determine 
which items, when deletv?d, not only maximized model fit, but 
also yielded the highest degree of predictive validity across 
validation samples; to this end. Models 6 to 9 were specified 
and tested* As indicated in Table 1, all four models led to a 
highly significant improvement in fit over Model 3 for each of 
the groups tested. It is important to note here that an 
examination of Mis indicated that, for each group of teachers, 
substantial improvement in model fit could be attained by 
freeing up additional parameters. However, since these 
parameters were group-specific, and since the intent of this 
study was to work towards improving the MBI for use with 
teachers as a professional group, there seemed little practical 
rationale for pursuing this course; given fit statistics that 
indicated a reasonable amount of the data was explained by 
Models 6 through 9, no further deletions of items were made. On 
the basis of these models, then, it appears that the deletion 
of Items 11 and 2 leads to a better-fitting factorial model for 
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teachers. Important now, however, is to see if this observation 
holds true when Model 9 is cross- validated across the split 
samples. We turn now to these cross-validation analyses. 
Cross-validation Analyses 

The proof of the pudding, so to speak, comes in the eating. 
So too, the proof of a factorial model comes in its validation 
across an independent sample. Having identified four sets of 
items that have the potential to improve the MBI for use with 
teachers, it was now appropriate to test the predictive 
accuracy of their fitted models. As noted by Bagozzi and Yi 
(1988), cross-validation is best regarded as a method for 
comparing alternative models, with the selection of those that 
most accurately approximate the underlying population 
structure . 

Of course, the most stringent assessment of competing 
models is accomplished by means of double cross-validation 
procedures (see Bagozzi & Yi, 1988; Cudeck & Browne, 1983); 
this was the approach used in the present study* As such, 
Models 3, 6, 7, 8, and 9 were estimated for the validation, as 
well as for the calibration samples. Analyses then proceeded in 
a 2-step manner. First, the parameters derived from the 
calibration sample were used for prediction in the validation 
sample, and the goodness-of ~f 1 t criteria derived. Second, these 
analyses were repeated, but with the roles of the calibration 
and validation samples reversed* For example. Model 3 was 
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estimated for the validation sample (Sample B) with the item 
measurement and factor covariance parameter estimates from the 
calibration sample (Sample A) fixed a priori; Model 3 was then 
estimated f the calibration sample with these same parameters 
fixed a priori to the values estimated for the validation 
sample (Sample B). 

With the introduction of their cross-validation index 
(CVI), Cudeck and Browne (1983) provided a means to evaluating 
cross-validation analyses within the framework of covariance 
structure models with latent variables. The CVI represents the 
discrepancy between the sample covariance matrix of one random 
half of the sample (say the calibration sample), and the 
restricted covariance matrix as imposed on the second random 
half of the sample (the validation sample). The model that 
exhibits the smallest CVI in each of the two sets of 
cross-validation analyses, is considered the one(s) with the 
highest degree of predir ve validity for a given sample. 
Results based on these procedures are presented in Table 3. 



Insert Table 3 about here 



These results indicated that Model 9, in general, had the 
greatesst predictive validity across independent samples. As 
such, they support earlier claims of best model fit, and argue 
for the deletion of Items 2, 11, 12, and 16. Results for 
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intermediate teachers presented the only exception to these 
findings; although results were consistent in showing the 
deleterious effects of Item 11. the deletion of Item 1 appeared 
to be more salient for intermediate teachers* Nonetheless, 
given the somewhat large differences in CVI's for the 
calibration and validation samples for intermediate teachers, 
more validity work with teachers at this level of the 
educational system is clearly warranted. The fact that, for 
elementary teachers, the detrimental influence of Item 10 
seemed equivalent to that of Item 11 again points to 
group-specific differences that exist for educators at 
different teaching levels. These cross-validation findings, 
together with those related to model fit strongly support the 
need for further investigation of MBI measurements and 
structure across teaching levels. 
Tests for Invariance Across Calibration and Validation Samples 
In a final check of MBI factorial validity for teachers, 
the items and underlying structure comprising Model 9 (i.e., 
the model exhibiting the greatest predictive validity) was 
tested for their invariance across calibration and validation 
samples. As such, three increasingly restrictive hypotheses 
were tested; these bore on the equivalency of (a) number of 
underlying factors, (b) item scaling units, and (c) latent 
factor relations. Because each hypothesis was nested within the 
preceding one, this cumulative testing procedure provided 
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another extremely power ful test of the MBI Educator's Survey, 

Testing for invariance involved specifying a model in 
which certain parameters were constrained equal across groups 
and then comparing that model with a less restrictive model in 
which these parameters were free to take on any value- As with 
model-fitting, the ^X^between competing models provided a basis 
for determining the tenability ^^7~the hypothesized equality 
constraints; a significant indicating noninvariance . (For < 

more extensive discussion and application of this procedure, 
see Byrne, 1989.) These results are reported in Table A. 



Insert Table A about here 



Remarkably, these findings revealed all item scaling units 
and factor covariances to be equivalent across calibration and 
validation samples. Since measuring instruments can often be 
group-specific in the way they operate (Byrne et al . , 1989), 
findings of noninvar iance would have been more expected, than 
unexpected. The fact that this was not the case, provides 
additional support for the factorial validity of a modified MBI 
instrument that does not include Items 2, 11, 12, and 16. 

Conclusion 

The present study investigated the factorial validity of 
the MBI for teachers as a single professional group, and 
separately, for elementary, intermediate, and secondary 
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educators, the three teacher populations comprising the total 
sample. Both CFA and EFA procedures were used to validate the 
hypothesized 3-factor structure. Subsequently, CFA was used to 
(a) identify scale items that deteriorated the psychometric 
soundness of the instrument for each of these groups (b) to 
determine which set of items yielded the highest degree of 
predictive validity via double cross-validation procedures, and 
(c) to test for the invariance of number of factors, item 
scaling units, and factor relations across calibration and 
validation samples based on the model yielding the greatest 
predictive accuracy. 

Although findings from the present study. In general, were 
consistent with previous factor analytic research on the MBI in 
validating a 3-factor structure, this structure was only 
modestly well-defined. Additional exploratory work revealed 
substantial improvement in model fit with the deletion of two 
items measuring emotional exhaustion (#2, #16), one measuring 
depersonalization (#11), and one measuring personal 
accomplishment (#12). These findings substantiated previous 
research that reported problematic factor loading patterns for 
these items with respect to both teachers (Belcastro et al., 
1983; Byrne, 1990a; Firaian & Blanton, 1987; Green & Walkey, 
1988), and other professional workers ( Golembiewski et al., 
1983; Powers & Gose, 1986). Ambiguous item content is believed 
attributable to the problematic fit for Items 12 and 16; 
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niulticollinearity resulting from duplicated item content was 
clearly the problem with respect to Items 2 and 11 (see Bagozzi 
& Yi, 1988; Rindskoff, 1984). Indeed, deletion of these four 
items led to a revised MBI instrument t*^^t yielded a 
significantly improved 3-factor structure that withstood 
extremely stringent tests of double cross-validation and 
factorial invariance across independent samples. 

The fact that the 3-factor MBI structure, as orlgi-.ally 
proposed by Maslach and Jackson (1986), fit less well for 
intermediate teachers, and that this finding replicated those 
reported in another study of the same population (Byrne, 1990a) 
is both disconcerting and puzzling. Both studies determined a 
3-factor, rather than either a 2-, or 4-factor solution to be 
optimal, yet both studies also found a similar pattern of 
several badly cross-loaded items. At first blush, based on the 
present study, one might be quick to point to the relatively 
small sample of intermediate teachers compared with that for 
elementary and secondary teachers as a possible explanation? 
this, however, would appear not to be the answer since in the 
other study, all samples, albeit smaller, were approximately 
equivalent in size* 

While it seems clear that certain MBI items, particularly 
those measuring emotional exhaustion and depersonalization, are 
apparently being perceived differently by intermediate 
teachers, the dilemma as to why this should be so, is an 
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intriguing one. Is it a question of personality type within the 
Jungian typology as suggested by Garden (1987), or does the 
answer lie in the organizational structure and administration 
of intermediate/ middle (i*e., junior high) schools? Given both 
the number (5) and severity of EE items c ros s- 1 oad ing onto the 
DP factor (.65 to .53), Garden's (1987) argument that, for 
certain personalities (i.e., "thinking", as opposed to 
"feeling" types as defined within Jungian theory), and for 
certain occupational settings, the Depersonalization factor 
tends not to be salient, is certainly worthy of some 
reflection, and future scientific inquiry. Garden notes that 
within the educational profession, as with other human service 
occupations, most teachers exhibit "feeling" type Jungian 
personality characteristics. Thus, it seems unlikely that 
teachers at the intermediate level should differ substantially 
from their elementary and secondary school colleagues in this 
regard. Indeed, a more reasonable explanation seems linked in 
some way to sociological factors associated with the work 
environment of intermediate teachers. The answer to this 
perplexing problem, however, must remain the task of future 
research . 

The present study has imposed the most statistically 
rigorous and powerful tests to date on the MBI in testing the 
validity of its factorial structure. On the basis of these 
findings, together with earlier studies noted above, it is 
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unquestionably clear that at least four items (#2, #11, #12, 
#16) must either be deleted or altered in terms of item content 
when the MBI is used with members of the teaching profession. 
In light of the extremely stringent analyses performed, the 
modified MBI that took these deletions into account proved 
itself to be remarkably psychometrically sound. Although 
researchers and practitioners should be cognizant of 
differential item interpretation by elementary, intermediate, 
and secondary teachers, these findings are not considered 
serious enough to deter use of the suggested revised MBI with 
educators as a single professional group, as long as the group 
does not extend to the post secondary level (see Byrne, 1990a). 
Further construct validity research that includes testing for 
the invariance of MBI measurement and structure across teaching 
levels, however, is now needed in order to either confirm or 
disconfirm this claim; this work is currently in progress by 
the present author. 
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Appendix 

Breakdown of MBI Jtems on Factors of Burnout 



Emotional Exhaustion 

1 . I feel emotionally drained from my work. 

2. i feel used up at the end of the workday. 

3. I feel fatigued when I get up in the morning and have to face another day on the 
job. 

6. Working with people all day is really a strain for me. 
8. I feel burned out from my work. 

13. I feel faistrated by my job. 

14. 1 feel I'm working too hard on my job. 

16. Working with people directly puts too much stress on me. 
20. I feel like I'm at the end of my rope. 



Depersonalization 

5. I feel 1 treat some students as if they were impersonal objects. 

10. I've become more callous toward people since I took this job. 

11. I worry that this job is hardening me emottonally. 
15. I dont care what happens to some students. 

22. I feel students blame me for some of their problems. 



Personal Accomplishment 

4. I can easily understand how my students feel about things. 

/. I deal very effectively with the problems of my students. 

9. I feel I'm positively Influencing other people's lives through my work. 

1 2. I feel very energetic. 

17. I can easily create a relaxed atmosphere with my students. 

18. I feel exhilarated after working closely with my students. 

1 9. I have accompHshed many worthwhile things In this job. 
21. In my work, I deal with emotional problems very calmly. 
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Footnote 

An Ml may be computed for each constrained parameter and indicates the 
expected decrease in if the parameter were to be relaxed; the decrease, 
however, may actually be higher. Ml values < 5.00 indicate little appreciable 
improvement In model fit. 
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Table 1 

fiimrv ??nf>"Mtorv factor Anatvtfc Raaults for a 3-FactQr Structure* 



Sawple A (Calibration) Sample 8 (VaUdation) 

Coepetfng Model df Ax^ Adf AGFI CFI PCFI Co^rfson df Ax^ Adf AGFI CFI PCFI 











(D 


« 146^) 








Total 








« 1465) 








0 


(22 (tero) 


13,713.22 


231 


— 


- — 


— 


— — 


— 


— 


14,321.87 


231 





— 





— 


— 


1 


Initial 


2,413.K 


206 


— 


— 


.83 


.84 


.75 




2,397.89 


206 





— 


.83 


.84 


.75 


2 


EFA . 


2,088.24 


204 


325.61 


2 




.86 


.76 


Z v» 1 





' - 





— 





— 


— ^ 




specification 
































0 


(20 items) 


11,757.42 


190 


— 


— 


— 











12,367.26 


190 





— 





— 


- — 


3 


Item 16, 12 


1,553.23 


167 


535,01 


37 


.87 


.88 


.77 


3 vs 2 


1,573.32 


167 





— 


.87 


.88 


.77 




deleted 
































4 


Model 3 with 


1,231.66 


166 


321.57 


1 


.90 


.91 


.80 


4 vs 3 


















correlated 


































error bett#een 


































Items 1 md 2 
































5 


Kodei 4 with 


1,029.30 


165 


202.36 


1 


.91 


.93 


.81 


5 vs 4 


















correlated 


































error between 


































Items 10 and 
































0 


11 

(18 Stem) 


9,166.14 


153 














9,793.73 


153 












6 


Model 3 with 


952.69 


132 


600.54 


35 


.91 


.91 


.79 


6 vs 3 


950.93 


132 


622.39 


35 


.91 


.92 


.80 




Items 10, 1 


































deleted 
































7 


Model 3 with 


900.50 


132 


652.73 


35 


.92 


.91 


.79 


7 vs 3 


919.84 


132 


653.48 


35 


,91 


.92 


.80 




Items 10, 2 


































deleted 
































8 


Model 3 with 


875.59 


132 


677,64 


35 


.91 


.92 


.80 


8 vs 3 


935.17 


132 


638.15 


35 


.91 


.92 


.80 




Items 11, 1 


































deleted 
































9 


Model 3 with 


831.54 


132 


721-6 


35 


.92 


.92 


.80 


9 vs 3 


905.10 


132 


668.22 


35 


.92 


.92 


.80 



Items 11, 2 
deleted 



(Q » 580) Elementary (q = 579) 



0 


{22 ft«ns} 


5,598.46 


231 














5,588.79 


231 












1 


Initial 


1,010.91 


206 






.83 


.85 


.76 




1,009.16 


206 






.83 


.85 


.76 


2 


CM 


902.86 


204 


108.05 


2 


.84 


.87 


.77 


2 vs 1 


















specif {cation 
































0 


(20 items) 


4,737.51 


190 














4,729.33 


190 












3 


iteos 16, 12 


618.52 


167 


284.34 


37 


.87 


.90 


.79 


3 vs 2 


745.20 


167 






.85 


.87 


.76 




deleted 


































Model 3 with 


532.42 


166 


86.10 


1 


.89 


.92 


.80 


4 vs 3 


















correlated 


































error between 


































Items 1 and 2 
































5 


Model 6 with 


474.87 


165 


57.55 


1 


.90 


.93 


.81 


5 vs 4 


















correlated 


































error between 


































ItesB 10 and 

11 
































0 


(18 itens) 


3,738.64 


153 














3,732.18 


153 












6 


Model 3 with 


422.72 


132 


195.80 


35 


.91 


.92 


.80 


6 vs 3 


477.53 


132 


267.67 


35 


.89 


.90 


.78 




Items 10, 1 


































delete i 
































7 


Model 3 with 


401.65 


132 


216.87 


35 


.90 


.92 


.80 


7 vs 3 


469.89 


132 


275,31 


35 


.89 


.91 


.79 




Items 10, 2 


































/fileted 
































8 


Model 3 with 


426.21 


132 


192.31 


35 


.90 


.92 


.80 


8 vs 3 


469.39 


132 


275-81 


35 


.89 


.91 


.79 




It«M 11, 1 


































deleted 
































9 


Model 3 with 


406.02 


132 


212.50 


35 


.91 


.92 


.80 


9 vs 3 


464.66 


132 


280.54 


35 


.89 


.91 


.79 



Itew 11, 2 
deleted 
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Tabl* 1 (Cont'd) 



Saaple A <CaHbrat<oo) Saa^ie 8 (VaUdation) 

Coofsetinfl Model df Adf AGFI CFl PCFt Cc^rfson %^ df Ax^ Adf ACFl CFI PCFt 













« 194) 








Intermediate 








» 194) 








0 


(22 item) 


1,985.61 


231 


- — • 


— 








— — . 




2,206.57 


231 












1 


Initial 


605.92 


206 




— 


.71 


.77 


.69 




559.63 


206 






.75 


.82 


.73 


2 




566.66 


204 


39.26 


2 


.72 


♦ 79 


*70 


2 V8 1 




• 














specification 
































0 


(20 iteiB) 


1 ,718.24 


190 















1 ,ooB.31 


190 












3 


Items 16, 12 


435.52 


167 


131 . 14 


37 


.74 


.82 


.72 


3 V8 ^ 


390 •78 


167 






. fV 


.o7 


. f 6 




deleted 
































4 


Model 3 Mith 


367.73 


166 


67.79 


1 


.78 


.87 


.76 


4 V8 3 


— 


■ 














correlated 


































error between 


































I tens 1 and 2 
































5 


Model 4 with 


350.90 


165 


16.83 


1 


.79 


.88 


.76 


5 V8 4 


















correlated 


































error between 


































Itass 10 and 
































0 


11 

(13 itoas) 


1,369.50 


153 














1,475.27 


153 












6 


Model 3 with 


298.93 


132 


136,59 


35 


.80 


.86 


.74 


6 vs 3 


277.24 


132 


113.54 


35 


.83 


.89 


.77 




Ite» 10, 1 


































deleted 
































7 


Model 3 with 


305.34 


132 


130-18 


35 


.79 


,86 


.74 


7 vs 3 


271.05 


132 


119.73 


35 


.83 


.89 


.77 




Items 10, 2 


































deleted 
































8 


Model 3 with 


307.71 


132 


127.81 


35 


.79 


.86 


.74 


8 vs 3 


248.05 


132 


142.73 


35 


.84 


.91 


.79 




Items 11, 1 


































deleted 
































9 


Model 3 with 


315-81 


132 


119.71 


35 


.79 


.85 


.73 


9 vs 3 


243.89 


132 


146.89 


35 


.85 


.92 


.80 




Items 11, 2 


































deleted 

































(Q » 692) Secondary (Q » 692) 



0 


(22 itrnw) 


6,423.41 


231 














6,656.56 


231 












1 


Initial 


1,281.20 


206 






.81 


.83 


.74 




1,209.60 


206 






.82 


.84 


.75 


2 


EFA 


1,133.74 


204 


147.46 


2 


.83 


.85 


.75 


3 vs 2 


















specification 
































0 


(20 items) 


5,553.55 


190 














5,736.33 


190 












3 


I ten 16, 12 


895.98 


167 


237.76 


37 


.84 


.86 


.76 


3 vs 2 


804.10 


167 






.86 


.89 


.78 




deleted 
































4 


Model 3 Mith 


741.83 


166 


154.15 


1 


.87 


.89 


.78 


4 vs 3 


















correlated 


































error between 


































Items 1 and 2 
































5 


Model 4 with 


626.67 


165 


115.16 


1 


.89 


.91 


.79 


5 vs 4 


















correlated 


































error between 


































Item 10 and 
































0 


11 

(18 item) 


4,265.32 


153 














4,584.13 


153 












6 


Model 3 with 


558.06 


132 


337.92 


35 


.89 


.90 


.78 


6 vs 3 


503.70 


132 


300.40 


35 


.W 


.92 


.80 




Item 10, 1 


































deleted 
































7 


Model 3 with 


525.52 


132 


370,46 


35 


.90 


.90 


-78 


7 vs 3 


503.73 


132 


300.37 


35 


.90 


.92 


.80 




Item 10, 2 


































deleted 
































8 


Model 3 with 


499.3*) 


132 


396.68 


35 


.90 


.91 


.79 


8 vs 3 


504.65 


132 


299.45 


35 


.90 


.92 


.80 




Item 11, 1 






























< 




deleted 
































9 


Model 3 with 


472.64 


132 


423.34 




.91 


.92 


.80 


9 vs 3 


505.16 


132 


298.94 




.90 


.92 


.80 




Item 11, 2 


































deleted 

































^ All ix values statUticalty significant at p < .001 
^ Itemi 16 (EE) and 12 (PA) free to load on DP and EE, respectively 
O Imoticnal Exhaustion; 1^ > Depersonalization; PA « Personal Accoa|)lislMnt 
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Table 2 



Summary Exoloratofv Factor Analvtic^for a 3-Factor Structure 

Total Elementary Intennediate Secondary 

(n = 2931) (n = 1159) (n - 388) (n - 1384) 



Target Loadings* 
(22 factor loadings) 

High .87 .87 .87 .91 

Low .30 .29 .34 .32 

Median .GO .59 .52 .60 

%£.35 95.45 95.45 68.18 95.45 



Nontarget Loadings'* 
(44 factor loadings) 

High .39 .36 .65 .42 

Low .00 .00 .00 .01 

Median .06 .05 .12 .07 

%t.35 2.27 2.27 20.45 2.27 



Factor Correlations 
(3 factor correlations) 

High .50 .53 -.46 .45 

Low -.29 -.33 -.04 -.27 

Median -.47 -.49 -.41 -.43 

%£.35 66.67 66.67 66.67 66.67 



Items with #12 (.39) #12 (.36) #12 (.37)^ #12 (.42) 

cross-loadings > .20 #16 (.23) #16 (.23) (.26) #16 (.21) 

#16 (.64) 



* Target loadings are factor loadings on the factor which the item was designed to 
measure 

^ Nontarget loadings are all other loadings 

° Additional cross-loaded items > .20 for intermediate teachers were: #20 (.65); #1 3 
(.63), (.24); #8 (.63). (.38); #3 (.44), (.46); #15 (.53); #19 (.37); #9 (.30); #18 (.30); 
#14 (.26) 
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Tabl« 3 



Cof^setine Model 



Sample A (Calibration)* 



Sample B (Validation)' 



df 



CVI 



df 



CVI 



Total 



3 


Items 16, 12 
deleted 


1620.06 


190 


1.09 


1597.59 


190 


1.07 


6 


Items 10, 1 
deleted 




153 


.67 


10(».O1 


153 


.67 


7 


Item 10, 2 
deleted 


966.81 


153 


.65 


946.37 


153 


.63 


8 


Items 11, 1 
deleted 


983.98 


153 


.66 


922.01 


153 


.62 


9 


Items 11, 2 
deleted 


9S3.K 


153 


.64 


877.87 


153 


,59 



Elementary 



ERIC 



3 


Items 16, 12 
deleted 


789.85 


190 


1.36 


663.10 


190 


1,15 


6 


Items 10, 1 
deleted 


527.34 


153 


.91 


471.56 


153 


.81 


7 


Items 10, 2 
deleted 


518.85 


153 


.90 


449.82 


153 


.78 


8 


Items 11, 1 
deleted 


516.24 


153 


.89 


472.71 


153 


.82 


9 


Items 11, 2 
deleted 


510.44 


153 


.88 


451.63 


153 


.78 









Intermediate 








3 


Items 16, 12 
deleted 


445,50 


190 


2.31 


483.51 


190 


2,51 


6 


Items 10, 1 
deleted 


314.99 


153 


1.63 


335.15 


153 


1,74 


7 


Items 10, 2 
deleted 


322.37 


153 


1.67 


342.84 


153 


1,78 


8 


Items 11, 1 
deleted 


3(»}.06 


153 


1.55 


356.93 


153 


1,85 


9 


Items 11, 2 
deleted 


353.49 


153 


1.83 


366.51 


153 


1,90 








Seconttery 








3 


Items 16, 12 
deleted 


856.77 


190 


,89 


948.01 


190 


,99 


6 


Items 10, 1 
deleted 


561.32 


153 


.81 


612.88 


153 


,89 


7 


ItsM 10, 2 
deleted 


560.20 


153 


.81 


579.22 


153 


.84 


8 


ItMS 11, 1 
deleted 


560.17 


153 


.81 


551.80 


153 


.80 


9 


Items 11, 2 
deleted 


559.94 


153 


.81 


524.46 


153 


.76 



Plodel ettiMtfon bMad on $ai^>la 8 covarfanca Mtrix; aodel apacif ication constraioi^ factor loading and 
corralation paraMtera to be fixad a priori baaad on %mpl0 A aatiMtaa 

Nodal aatfnatfon bMed on Saa^ile A covarianca natrix; nodal apacif ication ccnatrainad factor loadings and 
correlation paraMtari to ba fixad a priori based on S^qpla B aatinatas 
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Table 4 

Tests for Eojivalent Measurement and Structure Across CaHbratlon and Val ictetioo Sawoles Based on Hodet 9^ 



Cooijetfng Model 



df 



Ax' 



Adf 



1. Equivalent ruvber of factors (3) 

2. Equivalent Iteia measureo^its 

3. Equivalent iten neasurMients and 
factorial strticture 



Total 
1736.65 
1752.97 
1760.40 



264 
279 
285 



16.32 
7.43 



15 
6 



NS 
MS 



1. Equivalent number of factors (3) 

Z. Equivalent iteia measureoents 

3. Equivalent iten weasurenients and 
factorial structure 



Eleawntary 
870.68 
891.22 
893.65 



264 

279 
285 



20.54 
2.43 



15 
6 



NS 
NS 



1. Equivalent number of factors (3) 

2. Equivalent item neasurements 

3. Equivalent iten measuroiMnts and 
factorial structure 



Intermediate 
559.70 
573.93 
585.93 



264 
279 
285 



14.23 
12.00 



15 
6 



NS 
NS 



1. Equivalent rumter of factors (3) 

2. Equivalent item neasureoiants 

3. Equivalent iten raasurenents and 
factorial structi«-e 



Secondary 
977.80 
994.87 
1004.44 



264 
279 
2© 



17.07 
9.57 



15 
6 



NS 
NS 



NS ■ not 8i9nific»it 

' Items 2, 11, 12, 16 deleted 
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